NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

NotebookOS: A Replicated Notebook Platform for Interactive Training with On-Demand GPUs

Carver, Benjamin; Zhang, Jingyuan; Wang, Haoliang; Mahadik, Kanak; Cheng, Yue (March 2026, The ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

Interactive notebook programming is universal in modern ML and AI workflows, with interactive deep learning training (IDLT) emerging as a dominant use case. To ensure responsiveness, platforms like Jupyter and Colab reserve GPUs for long-running notebook sessions, despite their intermittent and sporadic GPU usage, leading to extremely low GPU utilization and prohibitively high costs. In this paper, we introduce NotebookOS, a GPU-efficient notebook platform tailored for the unique requirements of IDLT. NotebookOS employs replicated notebook kernels with Raft-synchronized replicas distributed across GPU servers. To optimize GPU utilization, NotebookOS oversubscribes server resources, leveraging high inter-arrival times in IDLT workloads, and allocates GPUs only during active cell execution. It also supports replica migration and automatic cluster scaling under high load. Altogether, this design enables interactive training with minimal delay. In evaluation on production workloads, NotebookOS saved over 1,187 GPU hours in 17.5 hours of real-world IDLT, while significantly improving interactivity.
more » « less
Free, publicly-accessible full text available March 22, 2027
Implementation and Evaluation of IEEE 802.11ax Channel Sounding Frame Exchange in ns-3

https://doi.org/10.1145/3592149.3592152

Zhang, Jingyuan; Avallone, Stefano; Blough, Douglas M. (June 2023, ACM)

Full Text Available
λFS: A Scalable and Elastic Distributed File System Metadata Service using Serverless Functions

https://doi.org/10.1145/3623278.3624765

Carver, Benjamin; Han, Runzhou; Zhang, Jingyuan; Zheng, Mai; Cheng, Yue (March 2023, ACM)

Full Text Available
SION: Elastic Serverless Cloud Storage

Zhang, Jingyuan; Wang, Ao; Ma, Xiaolong; Carver, Benjamin; Newman, Nicholas; Anwar, Ali; Rupprecht, Lukas; Skourtis, Dimitrios; Tarasov, Vasily; Yan, Feng; et al (August 2023, International Conference on Very Large Data Bases (VLDB 2023))

Full Text Available
InfiniStore: Elastic Serverless Cloud Storage

https://doi.org/10.14778/3587136.3587139

Zhang, Jingyuan; Wang, Ao; Ma, Xiaolong; Carver, Benjamin; Newman, Nicholas John; Anwar, Ali; Rupprecht, Lukas; Tarasov, Vasily; Skourtis, Dimitrios; Yan, Feng; et al (March 2023, Proceedings of the VLDB Endowment)

Cloud object storage such as AWS S3 is cost-effective and highly elastic but relatively slow, while high-performance cloud storage such as AWS ElastiCache is expensive and provides limited elasticity. We present a new cloud storage service called ServerlessMemory, which stores data using the memory of serverless functions. ServerlessMemory employs a sliding-window-based memory management strategy inspired by the garbage collection mechanisms used in the programming language to effectively segregate hot/cold data and provides fine-grained elasticity, good performance, and a pay-per-access cost model with extremely low cost. We then design and implement InfiniStore, a persistent and elastic cloud storage system, which seamlessly couples the function-based ServerlessMemory layer with a persistent, inexpensive cloud object store layer. InfiniStore enables durability despite function failures using a fast parallel recovery scheme built on the auto-scaling functionality of a FaaS (Function-as-a-Service) platform. We evaluate InfiniStore extensively using both microbenchmarking and two real-world applications. Results show that InfiniStore has more performance benefits for objects larger than 10 MB compared to AWS ElastiCache and Anna, and InfiniStore achieves 26.25% and 97.24% tenant-side cost reduction compared to InfiniCache and ElastiCache, respectively.
more » « less
Full Text Available
Optimizing Coverage with Intelligent Surfaces for Indoor mmWave Networks

https://doi.org/10.1109/INFOCOM48880.2022.9796762

Zhang, Jingyuan; Blough, Douglas M. (January 2022, Proceedings of the IEEE Conference on Computer Communications)

Reconfigurable intelligent surfaces (RISs) have been proposed to increase coverage in millimeter-wave networks by providing an indirect path from transmitter to receiver when the line-of-sight (LoS) path is blocked. In this paper, the problem of optimizing the locations and orientations of multiple RISs is considered for the first time. An iterative coverage expansion algorithm based on gradient descent is proposed for indoor scenarios where obstacles are present. The goal of this algorithm is to maximize coverage within the shadowed regions where there is no LoS path to the access point. The algorithm is guaranteed to converge to a local coverage maximum and is combined with an intelligent initialization procedure to improve the performance and efficiency of the approach. Numerical results demonstrate that, in dense obstacle environments, the proposed algorithm doubles coverage compared to a solution without RISs and provides about a 10% coverage increase compared to a brute force sequential RIS placement approach.
more » « less
Full Text Available
Brassinosteroid gene regulatory networks at cellular resolution in the Arabidopsis root

https://doi.org/10.1126/science.adf4721

Nolan, Trevor M.; Vukašinović, Nemanja; Hsu, Che-Wei; Zhang, Jingyuan; Vanhoutte, Isabelle; Shahan, Rachel; Taylor, Isaiah W.; Greenstreet, Laura; Heitz, Matthieu; Afanassiev, Anton; et al (March 2023, Science)

Brassinosteroids are plant steroid hormones that regulate diverse processes, such as cell division and cell elongation, through gene regulatory networks that vary in space and time. By using time series single-cell RNA sequencing to profile brassinosteroid-responsive gene expression specific to different cell types and developmental stages of theArabidopsisroot, we identified the elongating cortex as a site where brassinosteroids trigger a shift from proliferation to elongation associated with increased expression of cell wall–related genes. Our analysis revealedHOMEOBOX FROM ARABIDOPSIS THALIANA 7(HAT7) andGT-2-LIKE 1(GTL1) as brassinosteroid-responsive transcription factors that regulate cortex cell elongation. These results establish the cortex as a site of brassinosteroid-mediated growth and unveil a brassinosteroid signaling network regulating the transition from proliferation to elongation, which illuminates aspects of spatiotemporal hormone responses.
more » « less
Full Text Available
Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing

https://doi.org/10.1145/3419111.3421286

Carver, Benjamin; Zhang, Jingyuan; Wang, Ao; Anwar, Ali; Wu, Panruo; Cheng, Yue (October 2020, ACM Symposium on Cloud Computing 2020 (SoCC '20))
null (Ed.)
Executing complex, burst-parallel, directed acyclic graph (DAG) jobs poses a major challenge for serverless execution frameworks, which will need to rapidly scale and schedule tasks at high throughput, while minimizing data movement across tasks. We demonstrate that, for serverless parallel computations, decentralized scheduling enables scheduling to be distributed across Lambda executors that can schedule tasks in parallel, and brings multiple benefits, including enhanced data locality, reduced network I/Os, automatic resource elasticity, and improved cost effectiveness. We describe the implementation and deployment of our new serverless parallel framework, called Wukong, on AWS Lambda. We show that Wukong achieves near-ideal scalability, executes parallel computation jobs up to 68.17X faster, reduces network I/O by multiple orders of magnitude, and achieves 92.96% tenant-side cost savings compared to numpywren.
more » « less
Full Text Available
In Search of a Fast and Efficient Serverless DAG Engine

https://doi.org/10.1109/PDSW49588.2019.00005

Carver, Benjamin; Zhang, Jingyuan; Wang, Ao; Cheng, Yue (November 2019, 2019 IEEE/ACM Fourth International Parallel Data Systems Workshop (PDSW))

Full Text Available
InfiniCache: Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache

Wang, Ao; Zhang, Jingyuan; Ma, Xiaolong; Anwar, Ali; Rupprecht, Lukas; Skourtis, Dimitrios; Tarasov, Vasily; Yan, Feng; Cheng, Yue (February 2020, 18th USENIX Conference on File and Storage Technologies)

Internet-scale web applications are becoming increasingly storage-intensive and rely heavily on in-memory object caching to attain required I/O performance. We argue that the emerging serverless computing paradigm provides a well-suited, cost-effective platform for object caching. We present InfiniCache, a first-of-its-kind in-memory object caching system that is completely built and deployed atop ephemeral serverless functions. InfiniCache exploits and orchestrates serverless functions' memory resources to enable elastic pay-per-use caching. InfiniCache's design combines erasure coding, intelligent billed duration control, and an efficient data backup mechanism to maximize data availability and cost-effectiveness while balancing the risk of losing cached state and performance. We implement InfiniCache on AWS Lambda and show that it: (1) achieves 31 – 96× tenant-side cost savings compared to AWS ElastiCache for a large-object-only production workload, (2) can effectively provide 95.4% data availability for each one hour window, and (3) enables comparative performance seen in a typical in-memory cache.
more » « less
Full Text Available

« Prev Next »

Search for: All records